How to Implement a Time-Free Perfect Failure Detector in Partially Synchronous Systems
نویسندگان
چکیده
This paper introduces our partially synchronous Θ-Model, which is essentially the well-known FLP model augmented with a bound Θ on the ratio between the maximum and minimum end-to-end delays of messages simultaneously in transit between correct processes. We show that the Θ-Model admits the implementation of a perfect failure detector, which allows to employ classic solutions for solving important distributed computing problems like consensus. Since Θ may still hold when some assumed upper bound on the maximum delay is violated, those solutions work correctly in scenarios where synchronous implementations would fail. We show that our algorithm, which employs heartbeat messages and a timer-free timeout mechanism based upon synchronized heartbeat rounds, indeed satisfies the properties of a perfect failure detector and provides a number of attractive additional features.
منابع مشابه
Implementing the Weakest Failure Detector for Solving Consensus
The concept of unreliable failure detector was introduced by Chandra and Toueg as a mechanism that provides information about process failures. This mechanism has been used to solve several agreement problems, like Consensus. In this paper, algorithms that implement failure detectors in partially synchronous systems are presented. First two simple algorithms of the weakest class to solve Consen...
متن کاملFailure Detection with Booting in Partially Synchronous Systems
Unreliable failure detectors are a well known means to enrich asynchronous distributed systems with time-free semantics that allow to solve consensus in the presence of crash failures. Implementing unreliable failure detectors requires a system that provides some synchrony, typically an upper bound on end-to-end message delays. Recently, we introduced an implementation of the perfect failure de...
متن کاملEfficient Algorithms to Implement Unreliable Failure Detectors in Partially Synchronous Systems
Unreliable failure detectors, proposed by Chandra and Toueg [2], are mechanisms that provide information about process failures. In [2], eight classes of failure detectors were defined, depending on how accurate this information is, and an algorithm implementing a failure detector of one of these classes in a partially synchronous system was presented. This algorithm is based on all-to-all comm...
متن کاملPerfect Failure Detection in Timed Asynchronous Systems
Perfect failure detectors can correctly decide whether a computer is crashed. However, it is impossible to implement a perfect failure detector in purely asynchronous systems. We show how to enforce perfect failure detection in timed asynchronous systems with hardware watchdogs. The two main system model assumptions are (1) each computer can measure time intervals with a known maximum error, an...
متن کاملOn the Implementation of Unreliable Failure Detectors in Partially Synchronous Systems
Unreliable failure detectors were proposed by Chandra and Toueg as mechanisms that provide information about process failures. Chandra and Toueg defined eight classes of failure detectors, depending on how accurate this information is, and presented an algorithm implementing a failure detector of one of these classes in a partially synchronous system. This algorithm is based on all-to-all commu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008